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DETAILED ACTION 

1 . Claims 1 -20 have been examined. 

Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: Change in Power of Attorney as received on 10/12/2004 and 
RCE/IDS/Amendment as received on 2/22/2005. 

Amendment Format Comments 

3. It is asked that when applicant is deleting claim text having 5 characters or less, if the 
strike-through is difficult to perceive, double brackets should and must be used. For instance, in 
claim 1, the deletion of the letter "a" is difficult to perceive, and consequently double brackets 
must be used. Other examples would include deleting the letter "e" and number "4". Such 
precautions would assure prevention of non-complaint notices in the future. 

Claim Rejections - 35 USC § 112 

4. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

5. Claims 1-20 are rejected under 35 U.S.C. 112, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. 
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6. Regarding claims 1, 6, and 11, applicant has not ruled out that P cannot equal 0. If P=0, 
then the claim is unclear as it is not clear how the system could fetch from 0 streams. Also, if 
applicant did intend to include P=0, then the claim also has enablement issues. 

7. Regarding claims 1 and 6, it is not clear how a fetch algorithm can be coupled to 
predictors (hardware). That is, an algorithm is merely a set of steps which is to be performed. 
However, a set of steps cannot be coupled to hardware. 

Claim Rejections - 35 JJSC § 102 

8. The following is a quotation of the appropriate paragraphs of 35 U.S. C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

9. Claims 1-14 and 18-19 are rejected under 35 U.S.C. 102(b) as being anticipated by Yoaz 
et al., "Speculative Techniques for Improving Load Related Scheduling," May 1999 (as applied 
in the previous Office Action and herein referred to as Yoaz). In addition, Parady, U.S. Patent 
No. 5,933,627 (as applied in the previous Office Action and herein referred to as Parady) is cited 
as extrinsic evidence for showing that it is common to have separate hardware streams for each 
thread. 

10. Referring to claim 1, Yoaz has taught in a processor having multiple hardware streams 
supporting multiple data threads, and a data cache, a system for fetching instructions from up to 
P of the multiple hardware streams to a pipeline, where P is less than the number of multiple 
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hardware streams (for purposes of this examination, P is interpreted as being equal to 1), the 
system comprising: 

a) a fetch stage, for simultaneously fetching every cycle, the instructions from up to the P of the 
multiple hardware streams. See page 49, section 3.1, and note that 6 uops may be fetched per 
cycle (and uops correspond to instructions). For instance, a store instruction comprises 2 uops 
(see first paragraph of "prior work" section on page 43), and so 6 uops would correspond to three 
simultaneously fetched store instructions. 

b) multiple hit/miss predictors, each associated with a corresponding one of the multiple 
hardware streams, said each configured to forecast, at the fetch stage, whether corresponding 
instructions from said corresponding one of the multiple hardware streams will hit or miss the 
data cache. See page 47, column 1, and note the paragraph beginning with "Another. . .". 
Clearly, multiple threads exist, and a predictor is used for each thread. Consequently, there are 
multiple predictors. Also, the forecasts of these predictors would be used by the fetching 
hardware (if predicted to hit, continue fetching from same stream; otherwise, switch and fetch 
from another stream). As a result, the forecasts occur at the fetch stage. 

c) a fetch algorithm, coupled to said multiple hit/miss predictors, configured to select, on a cycle- 
by-cycle basis, the P of the multiple hardware streams from which to fetch the instructions. 
Again, see page 47, column 1, and note the paragraph beginning with "Another. . .". The fetch 
algorithm includes the steps of fetching from a current thread as long as a miss is not predicted, 
and when a miss is predicted, switching to and fetching instructions from a second thread. It 
should be realized that this happens on a cycle-by-cycle basis, as instructions are executed every 
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cycle. Whenever a load instruction appears, the prediction must be made, and if multiple loads 
occur in a row then the predictions are made for those consecutive cycles. 

1 1 . Referring to claim 2, Yoaz has taught a system as described in claim 1 . Yoaz has further 
taught that a hit prediction precipitates no change in the fetching of the instructions. See page 
47, column 1, and note the paragraph beginning with "Another. . .". It should be realized that 
Yoaz only discloses thread switching based on a predicted cache miss because that is when an 
expensive main memory access is likely required. When there is a hit in the cache, no main 
memory access is required and therefore, no switch needs to be made. 

12. Referring to claim 3, Yoaz has taught a system as described in claim 1 . Yoaz has further 
taught that a miss prediction results in switching the fetching to different ones of the multiple 
hardware streams. See page 47, column 1, and note the paragraph beginning with "Another. 
When a miss prediction is made, a thread switch occurs. That is, the system will begin fetching a 
new thread instead of sitting idle while the current thread finishes the time-consuming main 
memory access. A first switch may result in switching to a stream B, while another switch may 
result in switching to a stream C (different ones). Each stream is a flow of data from a source to 
a sink. That is from a first thread to the execution units is a first stream, a second thread to the 
execution units is a second stream, and so on. A hardware stream is nothing more than the 
hardware which supports this flow of instructions. In addition, Parady has shown that multiple 
hardware streams exist for multiple threads. See Fig.3, and note that the wires coupling the 
dispatch unit and each of the instruction buffers forms a multiple hardware streams, one for each 
thread, in which thread instructions may flow. 



Application/Control Number: 09/595,776 Page 6 

Art Unit: 2183 

13. Referring to claim 4, Yoaz has taught a system as described in claim 1 . Yoaz has further 
taught that said each of said multiple hit/miss predictors generates a confidence level value, and 
said confidence level value is used by said fetch algorithm to select the P of the one of the 
multiple hardware streams. See page 47, column 1 , and note the paragraph beginning with 
"Another. . .". Clearly, the predictors will predict a miss to occur or not to occur. This prediction 
is a confidence level value in that if a miss is predicted to occur, then the predictor is confident 
that the current thread will miss the cache, whereas if a miss is not predicted to occur, then the 
predictor is confident that the current thread will hit the cache. These confidence values are used 
to determine whether fetching will continue from the current thread or whether a thread switch 
occurs and fetching will begin from a new thread. 

14. Referring to claim 5, Yoaz has taught a system as described in claim 1. Yoaz has further 
taught that a said multiple hit/miss predictors further operate at a dispatch level to optimize the 
dispatch of consumer instructions by predicting latency of data. See page 47, column 1, and note 
the paragraph beginning with "Another. . .". It should be noted that the predictions determine 
which thread will be fetched from. Consequently, the predictors also determine which threads 
will be dispatched, as instructions which are fetched are also dispatched, otherwise they cannot 
be executed. And, it is known that consumer instructions are instructions which merely access 
some register or memory location to perform the desired operation, and therefore, it is expected 
that consumer instructions will be dispatched. In addition, the predictors predict the latency of 
data in that if a miss is predicted, then it is predicted that it will take a long latency to retrieve the 
data (since main memory is accessed). However, if a miss is not predicted, then it is predicted 
that a short latency is required as the data will be available in the cache. 
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15. Referring to claim 6, Yoaz has taught a processor having multiple hardware streams 
supporting multiple data threads, the processor comprising: 

a) a fetch stage, for simultaneously fetching every cycle, instructions from up to P of the multiple 
hardware streams, wherein P is less than the number of the multiple hardware streams. From 
page 47 (paragraph starting with "Another. . ."), let variable X represent the total number of 
possible threads. At the very least, P=l (P<X) and one thread is executed at a time. Also, see 
page 49, section 3.1, and note that 6 uops may be fetched per cycle (and uops correspond to 
instructions). For instance, a store instruction comprises 2 uops (see first paragraph of "prior 
work" section on page 43), and so 6 uops would correspond to three simultaneously fetched store 
instructions. 

a) a data cache, comprising a plurality of levels. See page 47, column 1, Fig.3, and the 
paragraphs beginning with "Employing. . ." and "Another. . .". It is clear that there is an LI and 
L2 cache. 

b) multiple hit/miss predictors, each associated with a corresponding one of the multiple 
hardware streams, said each configured to forecast whether corresponding instructions from said 
corresponding one of the multiple hardware streams will hit or miss said data cache (see page 47, 
column 1, and note the paragraph beginning with "Another. . .". Clearly, multiple threads exist, 
and a predictor is used for each thread. Consequently, there are multiple predictors. Also, each 
stream is a flow of data from a source to a sink. That is, from a first thread to the execution units 
is a first stream, a second thread to the execution units is a second stream, and so on. A 
hardware stream is nothing more than the hardware which supports this flow of instructions 
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(again, see Parady as described in the rejection of claim 1), said each of said multiple hit/miss 

predictors comprising: 

bl) a plurality of hit/miss predictors, each configured to forecast whether said 
corresponding instructions from said corresponding one of the multiple hardware streams 
will hit or miss one or more of said levels. See page 47, column 1, and note the 
paragraph beginning with "Another. . .". Note that it is predicted if the thread will miss 
the L2 cache. 

b2) a fetch algorithm, coupled to said multiple hit/miss predictors, configured to select, 
on a cycle-by-cycle basis, said P of the multiple hardware streams from which to fetch 
said instructions, wherein said fetch algorithm selects said P of the multiple hardware 
streams based upon whether said corresponding instructions from said corresponding one 
of the multiple hardware streams will hit or miss said one or more of said levels. Again, 
see page 47, column 1, and note the paragraph beginning with "Another. . .". The fetch 
algorithm includes the steps of fetching from a current thread as long as a miss is not 
predicted, and when a miss is predicted, switching to and fetching instructions from a 
second thread. It should be realized that this happens on a cycle-by-cycle basis, as 
instructions are executed every cycle. Whenever a load instruction appears, the 
prediction must be made, and if multiple loads occur in a row then the predictions are 
made for those consecutive cycles. 
16. Referring to claim 7, Yoaz has taught a processor as described in claim 6. Yoaz has 
further taught that a hit prediction precipitates no change in the fetching of said instructions. See 
page 47, column 1, and note the paragraph beginning with "Another. . .". It should be realized 
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that Yoaz only discloses thread switching based on a predicted cache miss because that is when 
an expensive main memory access is likely required. When there is a hit in the cache, no main 
memory access is required and therefore, no switch needs to be made. 

17. Referring to claim 8, Yoaz has taught a processor as described in claim 8. Yoaz has 
further taught that a miss prediction results in switching the fetching to different ones of the 
multiple hardware streams. See page 47, column 1, and note the paragraph beginning with 
"Another. . .". When a miss prediction is made, a thread switch occurs. A first switch may result 
in switching to a stream B, while another switch may result in switching to a stream C (different 
ones). That is, the system will begin fetching a new thread instead of sitting idle while the 
current thread finishes the time-consuming main memory access. Each stream is a flow of data 
from a source to a sink. That is from a first thread to the execution units is a first stream, a 
second thread to the execution units is a second stream, and so on. A hardware stream is nothing 
more than the hardware which supports this flow of instructions. 

18. Referring to claim 9, Yoaz has taught a processor as described in claim 6. Yoaz has 
further taught that said each of said multiple hit/miss predictors generates a confidence level 
value, and said confidence level value is used by said fetch algorithm to select said P of the 
multiple hardware streams. See page 47, column 1, and note the paragraph beginning with 
"Another. . .". Clearly, the predictors will predict a miss to occur or not to occur. This prediction 
is a confidence level value in that if a miss is predicted to occur, then the predictor is confident 
that the current thread will miss the cache, whereas if a miss is not predicted to occur, then the 
predictor is confident that the current thread will hit the cache. These confidence values are used 
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to determine whether fetching will continue from the current thread or whether a thread switch 
occurs and fetching will begin from a new thread. 

19. Referring to claim 10, Yoaz has taught a system as described in claim 6. Yoaz has 
further taught that a said multiple hit/miss predictors further operate at a dispatch level to 
optimize the dispatch of consumer instructions by predicting latency of data. See page 47, 
column 1, and note the paragraph beginning with "Another. . .". It should be noted that the 
predictions determine which thread will be fetched from. Consequently, the predictors also 
determine which threads will be dispatched, as instructions which are fetched are also 
dispatched, otherwise they cannot be executed. And, it is known that consumer instructions are 
instructions which merely access some register or memory location to perform the desired 
operation, and therefore, it is expected that consumer instructions will be dispatched. In 
addition, the predictors predict the latency of data in that if a miss is predicted, then it is 
predicted that it will take a long latency to retrieve the data (since main memory is accessed). 
However, if a miss is not predicted, then it is predicted that a short latency is required as the data 
will be available in the cache. 

20. Referring to claim 11, Yoaz has taught in a processor having multiple hardware streams 
supporting multiple data threads, and a data cache, a method for simultaneously fetching 
instructions every cycle from up to P of the multiple hardware streams to a pipeline, where P is 
less than the number of the multiple hardware streams (note that P=l, and P<X, where X is all of 
the available streams from which instructions may be fetched), the method comprising: 

a) for each of the multiple hardware streams, making a hit/miss prediction by a corresponding 
one of associated hit/miss predictors as to whether corresponding instructions for the each of the 
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multiple hardware streams previously fetched will hit or miss the cache. See page 47, column 1, 
and note the paragraph beginning with "Another. . .". Clearly, multiple threads exist, and a 
predictor is used for each thread. Consequently, there are multiple predictors, 
b) selecting, on a cycle-by-cycle basis, the P of the multiple hardware streams from which to 
fetch the instructions. Again, see page 47, column 1, and note the paragraph beginning with 
"Another. . .". The method includes the steps of fetching from a current thread (P=l) as long as a 
miss is not predicted, and when a miss is predicted, switching to and fetching instructions from a 
second thread. It should be realized that this happens on a cycle-by-cycle basis, as instructions 
are executed every cycle. Whenever a load instruction appears, the prediction must be made, and 
if multiple loads occur in a row then the predictions are made for those consecutive cycles. 

21 . Referring to claim 12, Yoaz has taught a method as described in claim 1 1 . Yoaz has 
further taught that said making comprises generating a confidence level value, and employing the 
confidence level value to select the P of the multiple hardware streams. See page 47, column 1, 
and note the paragraph beginning with "Another. . .". Clearly, the predictors will predict a miss 
to occur or not to occur. This prediction is a confidence level value in that if a miss is predicted 
to occur, then the predictor is confident that the current thread will miss the cache, whereas if a 
miss is not predicted to occur, then the predictor is confident that the current thread will hit the 
cache. These confidence values are used to determine whether fetching will continue from the 
current thread or whether a thread switch occurs and fetching will begin from a new thread. 

22. Referring to claim 13, Yoaz has taught a method as described in claim 1 1 . Yoaz has 
further taught further operating the multiple hit/miss predictors at a dispatch level to optimize the 
dispatch of consumer instructions by predicting latency of data. See page 47, column 1, and note 
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the paragraph beginning with "Another. . .". It should be noted that the predictions determine 
which thread will be fetched from. Consequently, the predictors also determine which threads 
will be dispatched, as instructions which are fetched are also dispatched, otherwise they cannot 
be executed. And, it is known that consumer instructions are instructions which merely access 
some register or memory location to perform the desired operation, and therefore, it is expected 
that consumer instructions will be dispatched. In addition, the predictors predict the latency of 
data in that if a miss is predicted, then it is predicted that it will take a long latency to retrieve the 
data (since main memory is accessed). However, if a miss is not predicted, then it is predicted 
that a short latency is required as the data will be available in the cache. 

23. Referring to claim 14, Yoaz has taught a method as described in claim 1 1 . Yoaz has not 
explicitly taught that the processor comprises a fine-grained mult i streaming processor that 
concurrently executes the instructions from the multiple hardware streams. However, Official 
Notice is taken that fine-grained multithreading (FMT) is well known and expected in the art. 
FMT allows expedites the completion of all of the threads being worked on, as every cycle, a 
next thread is switched in. Consequently, overall throughput is increased. And, it should be 
realized that Yoaz has taught that the prediction method will work for multithreaded systems, 
without explicitly saying what types (FMT, CMT, etc.). A person of ordinary skill in the art 
would have recognized that even with FMT, load instructions will be executed, and therefore, 
this prediction scheme will be useful in avoiding switching back to the thread that is predicted to 
miss the cache. As a result, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify Yoaz to be a fine-grained multistreaming system so that the 
advantages of an FMT system may be achieved. 
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24. Referring to claim 1 8, Yoaz has taught a method as described in claim 1 1 . Yoaz has 
further taught that said selecting comprises switching the fetching to a different P of the multiple 
hardware streams. See page 47, column 1, and note the paragraph beginning with "Another. 

25. Referring to claim 19, Yoaz has taught a method as described in claim 11. Yoaz has 
further taught that the data cache comprises a first level and a second level (see page 47, column 
1, Fig. 3, and the paragraphs beginning with "Employing. . and "Another. . .". It is clear that 
there is an LI and L2 cache.), and wherein said making comprises: 

a) first forecasting whether said corresponding instructions from the corresponding one of the 
multiple hardware streams will hit or miss the first level. See page 47, column 1, and note the 
paragraph beginning with "Another. . .". Note that if a miss is predicted at the second level of 
cache (L2), then a miss is inherently predicted at level one (LI). This is inherent because the 
nature of cache accessing forces it to be. More specifically, if there is no miss at level one, there 
is no need to access level two. Therefore, in order for a miss to matter at level two, there must 
have first been a miss at level one. 

b) second forecasting whether the corresponding instructions from the corresponding one of the 
multiple hardware streams will hit or miss the second level. See page 47, column 1, and note the 
paragraph beginning with "Another. . .". 

c) wherein said selecting comprises choosing the P of the multiple hardware streams based upon 
whether the corresponding instructions from the corresponding one of the multiple hardware 
streams will hit or miss the second level. Again, see page 47, column 1, and note the paragraph 
beginning with "Another. . .". The fetch algorithm includes the steps of fetching from a current 
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thread as long as a miss is not predicted, and when a miss is predicted, switching to and fetching 
instructions from a second thread. 

Claim Rejections - 35 USC §103 

26. The following is a quotation of 35 U.S. C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

27. Claim 15 is rejected under 35 U.S.C. 103(a) as being unpatentable over Yoaz, as applied 
above. 

28. Referring to claim 1 5, Yoaz has taught a system as described in claim 1 . Yoaz has 
further taught that the data cache comprises a first level and a second level (see page 47, column 
1, Fig. 3, and the paragraphs beginning with "Employing. . ." and "Another. . .". It is clear that 
there is an LI and L2 cache.), and wherein said each of said multiple hit/miss predictors 
comprises: 

a) a hit/miss predictor, configured to forecast whether said corresponding instructions from said 
corresponding one of the multiple hardware streams will hit or miss said first level. See page 47, 
column 1, and note the paragraph beginning with "Another. . .". Note that if a miss is predicted 
at the second level of cache (L2), then a miss is inherently predicted at level one (LI). This is 
inherent because the nature of cache accessing forces it to be. More specifically, if there is no 
miss at level one, there is no need to access level two. Therefore, in order for a miss to matter at 
level two, there must have first been a miss at level one. 
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b) a hit/miss predictor, configured to forecast whether said corresponding instructions from said 
corresponding one of the multiple hardware streams will hit or miss said second level. See page 
47, column 1, and note the paragraph beginning with "Another. . .". 

c) Yoaz has not explicitly taught separate first and second predictors for predicting first and 
second level cache misses, respectively. However, as shown in Nerwin v. Erlichman 168 USPQ 
177 (1969), to make separable is generally not given patentable weight or would have been an 
obvious improvement. Doing would allow a user to customize each predictor, separately. 
Consequently, it would have been obvious to one of ordinary skill in the art at the time of the 
invention to modify Yoaz's hit/miss predictor into first and second predictors for predicting first 
and second level cache misses, respectively. 

d) wherein said fetch algorithm selects the P of the multiple hardware streams based upon 
whether said corresponding instructions from said corresponding one of the multiple hardware 
streams will hit or miss said second level. Again, see page 47, column 1, and note the paragraph 
beginning with "Another. . .". The fetch algorithm includes the steps of fetching from a current 
thread as long as a miss is not predicted, and when a miss is predicted, switching to and fetching 
instructions from a second thread. 

29. Claims 16-17 and 20 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Yoaz, as applied above, in view of Ryan, U.S. Patent No. 5,694,572 (as applied in the previous 
Office Action). 

30. Referring to claim 16, Yoaz has taught a system as described in claim 1. Yoaz has 
further taught a network processor (Yoaz's processor may be use din any way including in a 
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network), but has not taught that said each of said multiple hit/miss predictors employs a flow 
number to which a packet belongs to forecast whether said corresponding instructions from said 
corresponding one of the multiple hardware streams will hit or miss the data cache. However, 
Ryan has taught the concept of operating on first process packets (instructions and data) wherein 
the cache would be populated with data specifically for the first process. Consequently, when a 
switch is made to a next process (and group of packets), the cache will not be optimized because 
it is populated with data from the first process. Therefore, the next process will result in many 
more cache misses until the cache is populated. See column 2, lines 1-16. As a result, one way a 
prediction could be made is based on process numbers or flow numbers (note that process 
numbers are well known and expected in the art as they are used to distinguish processes) 
because a new flow number would mean that the cache is not optimized for that new flow 
number and a miss is much more likely. This would be a simple way of making a prediction. As 
a result, it would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Yoaz to use flow numbers assigned to specific packets for making predictions. 
3 1 . Referring to claim 17, Yoaz has taught a processor as described in claim 6. Yoaz has 
further taught a network processor (Yoaz's processor may be used in any way including in a 
network), but has not taught that said each of said multiple hit/miss predictors employs a flow 
number to which a packet belongs to forecast whether said corresponding instructions from said 
corresponding one of the multiple hardware streams will hit or miss the data cache. However, 
Ryan has taught the concept of operating on first process packets (instructions and data) wherein 
the cache would be populated with data specifically for the first process. Consequently, when a 
switch is made to a next process (and group of packets), the cache will not be optimized because 
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it is populated with data from the first process. Therefore, the next process will result in many 
more cache misses until the cache is populated. See column 2, lines 1-16. As a result, one way a 
prediction could be made is based on process numbers or flow numbers (note that process 
numbers are well known and expected in the art as they are used to distinguish processes) 
because a new flow number would mean that the cache is not optimized for that new flow 
number and a miss is much more likely. This would be a simple way of making a prediction. As 
a result, it would have been obvious to one of ordinary skill in the art at the time of the invention 
to modify Yoaz to use flow numbers assigned to specific packets for making predictions. 
32. Referring to claim 20, Yoaz has taught a method as described in claim 1 1 . Yoaz has not 
taught that said making comprises employing a flow number to which a packet belongs to 
forecast whether the corresponding instructions from the corresponding one of the multiple 
hardware streams will hit or miss the data cache.. However, Ryan has taught the concept of 
operating on first process packets (instructions and data) wherein the cache would be populated 
with data specifically for the first process. Consequently, when a switch is made to a next 
process (and group of packets), the cache will not be optimized because it is populated with data 
from the first process. Therefore, the next process will result in many more cache misses until 
the cache is populated. See column 2, lines 1-16. As a result, one way a prediction could be 
made is based on process numbers or flow numbers (note that process numbers are well known 
and expected in the art as they are used to distinguish processes) because a new flow number 
would mean that the cache is not optimized for that new flow number and a miss is much more 
likely. This would be a simple way of making a prediction. As a result, it would have been 
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obvious to one of ordinary skill in the art at the time of the invention to modify Yoaz to use flow 
numbers assigned to specific packets for making predictions. 

Response to Arguments 

33. Applicant's arguments filed on February 22, 2005, have been fully considered but they 
are not persuasive. 

34. Applicant argues the novelty /rejection of claim 1 on page 13 of the remarks, in substance 
that: 

"It therefore does not follow that Yoaz anticipates a system for fetching instructions into a 
multi-threaded processor pipeline, for his article is directed towards execution scheduler 
limitations. Furthermore, to suggest that the teachings of Parady and Yoaz can be 
combined in a manner relative to Applicant's invention is out of place because Yoaz 
teaches how to schedule instructions for execution and Parady's invention is directed 
towards responding to the occurrence of long-latency events." 

35. These arguments are not found persuasive for the following reasons: 

a) Just because Yoaz discusses scheduling does not mean that Yoaz does not also teach 
applicant's claimed invention. More specifically, scheduling and fetching are directly related. If 
a certain thread is to be scheduled, then instructions from that thread must be fetched. Yoaz is 
concerned with switching threads on a cache miss prediction and fetching/scheduling 
instructions from a next thread. Also, that examiner has used Parady to show nothing more than 
the fact that threads may be assigned to different streams. With respect to the rejections, 
Parady's response to long-latency events does not come into play. 
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Conclusion 

36. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. Applicant is reminded that in amending in response to a rejection of claims, the 
patentable novelty must be clearly shown in view of the state of the art disclosed by the 
references cited and the objections made. Applicant must also show how the amendments avoid 
such references and objections. See 37 CFR § 1 . 1 1 1(c). 

McFarling, U WRL Technical Note TN-36: Combining Branch Predictors," 1993, has 
taught a gshare branch predictor which provides a prediction for a branch in a fetch stage. See 
Fig. 10 and note that the branch instruction address (held in the PC) is used to index a prediction 
table. This art is relevant because on page 47, column 2 of Yoaz, it is disclosed that all branch 
prediction techniques, including gshare, may also be adapted to perform hit/miss prediction. 
Consequently, predicting whether a load will miss a cache will also occur in the fetch stage as 
the load instruction's address (stored in the PC) will index the prediction table. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J. Huisman whose telephone number is (571) 272-4168. 
The examiner can normally be reached on Monday-Friday (8:00-4:30). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



DJH 

David J. Huisman 
May 5, 2005 




